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REPORTING CF DATES_1N THE 1967 LONDON TEST 


Data on age collected from census questionnaires or any other survey are 
subject to certain types of error. In every population there are ee who give 
inconsistent Becta avout their age or date of birth. Certain dates or ages may 
be preferred by some people. In the United States, for example, there has been 
heaping at ages 21 and 65. Over and under-reporting has also been found on data 
re: age at first marriage. There are several methods to find what the errors are 
and how to graduate or smooth the data. Given the importance of age, misreporting 


may affect many cross~tabulations where age is an important element. 


Age at census or at first marriage for the London Test, 1967, was 
calculated from answers to the question on month and year GO berth Orson Lirse 
marriage. Data from questions on date of first marriage and date of birth of the 
last born child were collected for women ever married and on a sample basis.) valk 
the data med in this analysis related to marriage or fertility represent about 
1/10 of the data collected from the London Test and about 1/3 of the data for 


_ 


women ever married collected from other tests. 


Innovations such as self-enumeration, self-coding, sampling and reporting 
of date of an event rather than completed age were tested during the 1967 census 


test. 


Reporting of age has improved from census to cengus. Heaping at ages 
ending in 0, 5 and 2 has declined.(1) | 
Some assumptions that we will attempt to prove using census test data 
can be made about age reporting: | ; 
1. Age heaping is in general less pronounced when date of birth is asked rather 
than actual age at last birthday. Heaping has been shifted, however, and 


(1) For a complete account see Tetlock, G. (1969) Canada and Zelnik (1964, 439) 
and Myers (1940, 395-415) U.S. 


preferred. digits have to be measured either in terms of preferred digits in 
which years end or in terms of individual years of age. 

The year when the census is taken may create heaping at certain ages. These 
ages would not have been over or under~reported if age at last birthday had 
been asked. (2) 

Recent events may be Weekend better, mainly when the event is fixed in 
time. A ee of this could be that non-response increases as the person 
gets older. 

When asking for a date it may be better to provide for the exact decade and 
year of birth and for the exact month of birth rather than groups of months. 
If date of birth is misreported as a consequence, other dates are likely to 
be misrevorted as well. Date of birth forms the basis for deriving age at 
first marriage and age of last born child. Age at first. marriage, for example, 
is calculated in terms of the years elapsed between date _of birth of the 
person and month and year of marriage. Misreportings of date of birth will 
necessarily lessen the accuracy of derived data. 

Once the errors are located, techniques should be developed to eliminate or 


reduce the errors. 


The data we shall analyse were derived from answers to questions in the 


1967 London:-Test schedule. Reference will be made to data from other census tests 


such as the Toronto Test and the 1968 Fall Tests. Age at census was derived from 


responses to a question on date of birth. These data were collected on a 100 per 


cent basis and were computer processed after the edit programme was completed. 


Date of first marriage and date of birth of the last born child were collected on 


a sample basis. From this sample data another sample was drawn and data were 


(2) The biases introduced when age at last birthday rather than date of birth is 


asked are of different nature. This paper is not an attempt to show the 
advantages of alternative wordings of the questions on age. 
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tallied manually. 


There are two evaluation studies related to age reporting. Gower's (1968) 


PR 


study on Comparative Rates of Partial Non-Response, Mail Returns on the London and 


Toronto Tests deals with response rates from a sample of documents. The study covers 
all the census questions, Tetlock's study (1969) deals specifically with the report- 
ing of age in London, 1967. Tetlock analyses and compares reporting of age in.1961, 
1966 and 1967 for the population of London by means of. several techniques one of 

them being Myers' Blended Method. The tatter oEney is helpful to clarify and expand 
points 1, 2 and 3 above. Gower's study provides some data that may clarify point 4, 


The present evaluation will cover points 1, 2, 3 and 4 mentioned above. 


New patterns in Age Reporting 


in order to test the assumption that heaping patterns have changed, the 
blending technique developed by Myers will be applied. 

Briefly, this technique derives a "blended" population which is a weighted 
sum of the persons who reported apes ending in digits 0 to 9. Table V shows the 
oe CREE Seon by single years of age used for ae analysis. 

The ee he assumption made by Myers (1942) is that the "blended" sum 
at each terminal digit should be very close to 10 per cent. Data from several U.S. 
Censuses and age distributions from Life Tables were employed by Myers in testing 
the AS of we assumption. (Myers 1942, 412-415). Deviations from 10 per cent 
indicate preference or avoidance of certain digits. 

Table I shows the measures of digit preference and avoidance as a result 
ea lvine the "blending" technique. The overall measure of over and under 
selection of particular digits in the Test census age distribution is the Index of 
Preference. This index is one half of the absolute sum of the deviations for each 


of the terminal digits. 
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: TABLE 1. Deviations from Ten Percent and 
- . Index of Preference 


Terminal digit 


Percentage 


Total aeebinus Deviation 
lea ; from 10% 

oe £ tion 
O 146,141 aal + 0.1 
1 141,942 Ks} — 0.2 
Pe P427,632 aS 0.2 
D 1455452 .O 
4 147 ,465 ne 
5 146,056 a 
6 145,969 pal 
fi 150,068 ee 
8 138,114 5) 
3 144), 150 .O 


Index of 
preference 


1,448,009 0.9 


Source: Table I was computed from Table V “Population 
by single years of age, London Census Test, 
1967". 

Preference for certain digits in Table I is shown by a positive deviation 
iron 10 per cent. Avoidance of a digit is represented in terms of a negative devia- 
tion from 10%. The greater the positive deviation from 10 per cent, the greater the 
heaping for the given digit. 

If digit preferences were manifested in terms ae terminal digits of years 
Pepaxted, we would expect that years ending in 0, 2 and 5 will be preferred. (3) 
Since the London Test took place at the end of 1967 (September) we assume that the 
digit preferred in terms of age would be 7 because people born in 1960, let us say, 
(3) Turner (1958) pointed out that since the most familiar way of counting is with 

the base ten, heaping should occur at ages ending in muitiples of ten, the next 


largest at multiples CLeEsvelas. - 2% Refer to Stanley H. Turner’s work on 
"patterns of Heaping in the Reporting of numerical Data". 
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will be 7 years old if born between January and September 1960. Preference for 
years ending in 2 =e be manifested at ages ending in 5. Preference for years 
ending in 5 would be manifested by heaping at ages ending in 2. 

This, at least forvheaping at ages ending in 7, is evident from Table lI. 
The subject, however, as we will see later, is still open to discussion. 

The index ae preference for London, 1967, is only 0.9. This indicates 
that heaping is not a serious problem in this case. The "blending" technique tials 
been used widely and the results presented here may ean ee to the heaping shown 
in data from other censuses. In Table VII (appendix) some ni@enees of digit pré- 
ference for different countries are given. It should be noted that in obtaining the 
indices of preference shown in Table VII ages under 10 were excluded, as well as ages 
90 and over. 

Te Getclear from Table 1 that heaping occurred at ages ending in’7, ' To 
counter-balance preference for digit 7 under-reporting or avoidance of ages ending 
in 8 is also shown. No other significant patterns can be determined from this measure 
of heaping. henene at ages ending in 5 is minimal. 

Differences in "heaping" by sex were also found. The ress followed to find 
the index of preference are given in the appendix, Tables Vi and VIII. Although heap- 
ing is more pronounced for males than for females this is not very meaningful because 
we cannot determine the sex of the person who filled the questionnaire, 

The age aieendbunion My: the population by sex is illustrated in the attached 
aoe pyramidsefiget 1. -Iteis difficult to make even general assumptions from this 
pyramid about age reporting. Intervening variables such as migration, mortality or 
actually smaller or bigger cohorts must be taken into account in a more detailed 
analysis of age misreporting. The possible effect of migration can be seen by 


examining the age distribution in Table V. 
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It would be useful to find heaping or preference at individual ages, that 
is to analyse the heaping patterns by single years of age. This would permit 7 to 
know whether some ages ending in 7 are more affected than others. There are several 
techniques which could be used such as graduation, frequency distributions, etc... We 
could thus determine whether or not heaping occurred mainly at certain key ages, such 
Poriges over 50, i.e., 57, 67, Fike? and 60 on. 

Z2einik ,»for ramet ein his analysis of the 1960 U.S. Census Fipures of age 
found that (Zalnaes 1964, 439-442): 


The major exception to the general improvement in age heaping 
occurred at age 59 for both sexes. In fact, heaping at age Soy net 
characteristic of earlier censuses, is the outstanding example of 
age preference in 1960. Age heaping errors at 59 are three times 
greater than at any earlier age among females, and four times greater 
than at any earlier age among males. The very pronounced heaping 
on age 59 is undoubtedly the result of the different age question 
asked in the 1960 census. Persons tabulated at age 59 in 1960 were 
those reporting their year of birth as 1900. The preference for 
the double-zero year apparently resulted in an overstatement Od 
1900 as year of birth which, when translated into age, resulted in 
the strong preference for age 59. - 

The argument of "year heaping"! rather than "ape heaping" cannot 
explain all of the changes in age preferences between 1950 and 1960, 
‘as illustrated by the avoidance in 1960'of most of the ages ending 
in 9 (i.e., for years of birth ending im 0, a digit which, in the 
past, has been heavily preferred), The preference on the average 
for 9 in 1960 indicated by Myers! "blended method" is almost solely 
a result of the heavy heaping on age 59. The phenomenon of age 
heaping and its measurement reflect more than the simple choosing 
of one digit and the avoidance of another, In general, however, 
apparent changes in age preferences between 1950 and 1960 presumably 
arise mainly from the change in age question and are, in effect, the 
result of continuations of past digital preferences. 


Assumption 2 has been partly proven, Since the 1971 census will be taken 


in’ June this may eliminate part of the effect of heaping. 


There is no evidence from the results of Gower's study (1968) that recent 
events are reported better (date of first marriage 4s opposed to date of birth). We 


must keep in mind, however, that due to the importance of the question on date of birth 


follow-up is more strict for this question. 
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TABLE Il. Partial Non-Response on Mail Returns, Tests Documénts 


Success 


Fail 
Or 
blank 


Follow- 
u 


Question description Initial 


Total 
Total 


London 


Meretten birth date ....... 
Coded month of birth . 
Coded decade of birth 
Coded year of birth . 

Written date of marriage 
Coded month of marriage 
Coded decade of marriage 
Coded year of marriage 

Momber of births .......<- 

Written birthdate of child 
Coded month of birth ... 
Coded decade of birth 
Coded year of birth 


Written birth date ....... of Bas ogee 256 2.4 Os5 
Coded month of birth ... ge) a2 POLO ey 2 1.4 ths 
Coded decade of birth rey A Se Wane a eer ae ies) 0.3 
Coded year of birth .8 =) SiO ea) eS 0.2 

Written date of marriage aa Picpiuaren sgl 6.6 Sls yey S 
‘Coded month of marriage fe) 29 C2 eG at 50! 4,1 
Coded decade of marriage sf LO s\peo tok Nhe aS ape) AS) 
Coded year of marriage i) Po OU LO nu) 50 sari 

mips Ore biTene .. nes. >> ne EG Beer We) oe 5.6 Br8 

Written birthdate of child sa SUE gest ami Wh heal ree Dae 4.9 
Coded month of birth we POT Cae ie soinre See 4.9 
Coded decade of birth . AL Soh) peg sl sie Be eel any: 4,9 
Coded year of birth .... me OMe a) See NL Oe) ue 4.9 


Miieineiided in Po. 

(2) Included in-P50. . 
(3) Included in P53. 

Source: This is part of Table 1. in Gower's (1968, 4) study. 


; f5u an ise i 
ny t Punad crrtn bas fiers Z 


i i 
my SG 


leveTia aa 
7! ae 
' Fhe Tr 
fye¢ a 
v Pats ey” 


: oe 
: i — = ————— er 
* i c% a oom ‘» 6S bo5 ong zi > as OF of 
| | sg aa 
3 ‘ — 4] Ca Beet 4 3 wth 4 o¢ < Ta oe 
s* A. ¢ 2 ad th eS | 7 ss vt . i : 3 ‘i -_ 


It is difficult to draw fanciiesous from Table II. The conditions and the 
populations enumerated were different in both London and Toronto. Gower (1968, 2) 
made the remark that "since we are only studying mail returns, and since the rate of 
mail response was much lower in Toronto than in London, the Toronto returns probably 
ent a more select (co-operative) group than do the London''. On the other hand 
the questions on age and on the other two dates are very similar in both tests. 

Making some internal comparisons of Gower's data we find that in toeden 
the manual editing of data was higher for questions 5 aha Porthan fer E50 vor VEZ 
which are the other two questions on age. This means that the Peers coded 
better P50 and P52. This could be explained because the later questions are answered 
at the end of the questionnaire and presumably the respondent knows better~how to 
Pode at this stage, in Other words, the practice effect is being felt. However, we 
could assume that non-response is higher by the end of the questionnaire because of 
the fatigue inherent to such a long task. It is possible thatithesceason ior better 
self coding in questions P50 an P52 is due to the way the coding positions were 
set up. Every month of the year was preceded by an entry while for the coding of 
date of birth months were grouped. This pattern under the column Edit is not observed 
for Toronto. The 1969 Trial Census will allow for the respondent to code his exact 
month of birth as well as the decade and the year and this aspect could be explored 
further. 

It seems that response rates are sensitive to the way the question is worded, 
In Toronto for example in form 2B (maxi Toronto) the respondent was asked to first 
code the date of birth and at the end of the question to write the date, In form 2A 


(mini Toronto) and the 1967 London Test form the question was reversed, first the 


write-in and second the coded entry. 
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Gower's study shows a eee response rate for the London Test and the 
Mini Toronto (Form 2A) where the initial success rate (Resp.) for P5 and P6 was over 
90%. In the case of Maxi Toronto the initial success rate (Resp.) for P4 fluctuated 
- between 89%, and 91.6% (Gower, 1968, Table 1, 4). 

The figures are small and the differences in percentage do not permit 
generalizations. 


When tabulations were prepared using date of birth, date of first marriage 


and date of birth of the last born child the following rates were found for London: 


TABLE III. Non-Response to Fertility Questions by Age, London 1967 Test 


Number and percentage(1) of non-response 
Women : 


Mi P50 P51 P52(2) | 
aes Eaeeiee Date of Number of [Date of birth Ber 
[Lay Sek Ve : : : and 
ee le ee First children of last P52(3) 
marriage ever born born child 
No. he No. vs 

Pie UN le OM Cl teeta tances) < arene 29 3.8 Ss 20 2D 
15-44 6 1.4 4 3) Ware 
ebm 40 _ _ _ - 
20-24 1 1.4 1 1 1.4 
25-29 = =, = 
30-34 - — _ _ 
35-39 3 3. Z 1 M74 
40-44 Z arse | 1 1.4 

45 and over 21 yoo is 6 as 
45-49 4 ae Ss Z oa 
50-54 1 E26 D 1 si) 
55-59 3 5.4 3 i Pec 
60-64 4 8.9 4 4 8.9 
BN en ake nove hiss =o 9 ee: 8 8 6.6 
Age not stated 2 100.0 2 1 50.0 


(1) The percentage of non-response is equivalent to Column Fail or Blank in Table Il. 

(2) Includes cases when there was an entry in P51 other than none but no entry in 
P52, it also includes cases when both P51, and P52 were blank, and when the 
three P50, P51 and P52 were blank. 

(3) Cases when the 3 questions P50, P51 and P52 were blank. 


The results are close to Gower's results, “When questions P50, P51 and P52 
are compared we note the higher rate of non-response for question P51 on number of 
children ever born. ee is difficult to determine the base for P52 because when the 
question on number of children is not answered we do not know whether an answer 
should have been given in P52, The results in Table III show a higher rate of non- 
mesponse ta P52(1). In this case if P51 was blank and P52 was blank both were con- 
sidered as blank. 


The results for women 45 years old and over in a rural area are slightly 


‘different, as shown in Table IV which follows: 


TABLE IV. Non-Response to Fertility Questions by Age 


(Annapolis, Durham, Lethbridge and Napierville, 1968 Fall Tests) 


Number and percentage(1) 


Total e 
; of non-response 
women ever ; 
; Z Date of 
married Date of + 
=. eve sintgul ona 
45 years Purse = 
2 ; last born 
and over marriage fees 
child 


Gave: sliver ee (9.61.0, 6 Wie o) © 0. 6 8\ 19) 8 
at YY etait eC) Dea eat bees “ee CuO acess an Ma Se Aa 

pielisikwl io) 01.6141 6) 6 Lelied@, ¢:16 6 ie: 16) 60/6 <0. :0. 61.6 00. eee CLOLe oe! 
DL wicelietialie (et oliekst oie, ¢1.e) ¢. 0! \¢) 2) (0 8: 0. | ¥' 68 “e (8\ 16 


ele © 8 6 6) 81s 6 (pe @ 210 6 6 2 6 © 68 © 


Pie MOL. SLACEC ysis ices + canis lo viv ie o sje ow + wine 2 willy wie aie 


(1) Percentages are horizontal. 


This table shows that non-response increases with BES at least in the case 
of females ever married, 45 years old and over living in rural areas. This pattern 
asvobserved in Table Ill also. For the group of 1,237 women in rural areas the non- 
response is lower for the most recent event (date of birth of last born ehiid). A 


more careful working of se5i could yield better results. 


om 
: : . £08 
Ww tad Sa Anal Se ‘one 


Vad: Vines Spaced ~~ ta a “er? 2 


aes 


vid Petree dav 


aan 
) sea le Prey an 


dh 
» Ca _ 
¢ pois? ods Hye, ep ai 
Lie em sh. ., 2G £ 
), ager. 5 oh 
iion@ 
i] 
{ ‘ 
i = 
ms 
8 
b Pe « 
> * ut a Li 
4 @ bine 


_ 
¢ ey, Gs wt tr OFS" 2S 


oi a 7 
ts typ hoc Oper apag 


« 
aay oy “ee 
ie one 
a) _ a 
\ : Pp - * 

i heel 

: . 


ted ew 


Spas 


It would have been very useful to have complete data on age SG fiz st 
marriage for London 1967. This would have allowed us to find preferences at certain 
ages or dates of first marriage. Since the population studied Poneueted of 770 
women ever married, sample variability and sample size in this case do not) permit 
oe er analysis. 

Conclusion: 

Data summarized here clarified some of our assumptions. One point to keep 
in mind is that when age is derived from the date when the event took place ''new 
biases" may be introduced and therefore if the errors are found to be significant 
new ways of analysing the data could be developed. The type of information requested 
and the wording of questions also has an effect on response rates. 

Assumption 5 could not be evaluated. Nevertheless, it seems obvious that 
misreporting of age would affect related data and that in the case of age at first 
marriage any error may well be cumulative. Comparison of Census data on age at 
first marriage and similar data from Vital Statistics would ae a way of evaluating 
the data. 

Mr, Tetiock (1969) investigated how heaping at certain final digits has 
changed over time. We may assume that reporting of age has improved in general. We 
dealt with census test data only. Tetiock's study dealing with the test census ques- 
tion on age and with poner on of age reporting in different See ee (using the 
population of London as basis) RP tics other aspects related to the reporting of 
age and to the techniques employed to measure age heaping. 

j Digit preference is only one of the Sorat ound in age data. Other 
studies on age that could be carried out are detailed analysis of heaping by single 


years of age; response rates to age questions by age; and in the case of date of 


birth of last born child, by age at first marriage; misreporting of persons (omission) 
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in certain ages or age groups, etc. Data on age from the 1969 Tripl Census could 
be compared to data from past censuses. 

Decisions could be made on how to tabulate and how to graduate the data 
once the problem of age misreporting and its magnitude is determined. 

If assumptions 1 and 2 are correct we may expect tnat ine Loi the pies 
ferred digit in age reporting would be l, and 2 or O would be most avoided among 
the digits. Analysis of data by single years of age from the 1967 London Test 
would show whether one particular age ending in 7 was preferred over all other 


ages and what age or ages are more affected by digit preference. 
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TABLE V. Population by Single Years of Age, London Census Test 1967 


ple rece irre ——$—_——— 


Female 


Age Total Male Female Age 


LONDON CITY 201,176 | 97,697 | 103,479 


Maer er leires. oa 3,342 1,706 PRGS6 2S wane nt sacks 


2 SONGE RAR ONCOR  eee G 3,415 i000 MOD 20a sess #6 ol 
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Source: Basic Tabulations, London City Census Test, 1967 "Population", Tab}e A-/1, DBS 
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TABLE V. Population by Single Years of Age, London Census Test 1967 — Concluded 


Age Total Male Female Male | Female 
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